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Abstract 


Some  asymptotic  results  for  a  kernel  type  estimator  of  the  quantile 

function  from  right-censored  data  are  obtained.  The  estimator  is  defined 
—1  1  * 

hy  Qj/p)  "  /q  Q^t^CCt-pJ/h^dt,  which  is  smoother  than  the  usual 

A  A  A 

product-limit  quantile  function  Q  (p)  ■  inf{t:  P  (t)  £  p},  where  F  denotes 

n  n  n 

the  product-limit  estimator  of  the  lifetime  distribution  Fq.  Under  the  random 
censorship  model  and  general  conditions  on  h^.K,  and  Fq,  the  asymptotic 
normality  of  Qn(p)  is  proven.  In  addition,  an  approximation  to  is 

shown  to  be  asymptotically  uniformly  equivalent  to  in  mean  square. 
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1.  Introduction 

\) 

In  reliability  and  medical  studies,  It  Is  often  of  interest  to  estimate 
various  quantiles  of  the  unknown  lifetime  distribution.  In  particular,  the 
median  lifetime  and  extreme  quantiles  are  of  interest  to  the  experimenter  In 
such  studies.  In  many  life  testing  and  medical  follow-up  experiments,  however, 
arbitrarily  right-censored  data  arise,  and  it  is  important  to  be  able  to  esti¬ 
mate  the  quantiles  of  interest  based  on  the  censored  data.  For  such  data, 
some  kernel-type  quantile  estimators  are  considered  in  this  paper  which  give 
smoother  estimates  than  the  usual  product-limit  quantile  function.  '  >  ’ 

For  any  probability  distribution  function  G,  denote  the  quantile  function 
by  QCp)  -  G  *(p)  ■*  inf{x:  G(x)  2  p},  0  £  p  £  1.  For  a  random  (uncensored) 

sample  Y^i...,Y  from  G,  the  sample  quantile  function  Gn*(p)  “  inf{x:  Gn(x)  £  p} , 
0  £  p  5  1,  has  been  used  to  estimate  Q(p) ,  where  G^  denotes  the  sample 
distribution  function.  CsBrg^f  (1983)  gave  many  of  the  known  results  concerning 
Gn*(p).  Also,  Falk  (1984)  studied  the  relative  deficiency  of  the  sample  quantile 
with  respect  to  kernel-type  estimators,  and  Falk  (1985)  obtained  asymptotic 
normality  for  kernel  estimators.  Yang  (1985)  has  obtained  some  convergence 
properties  of  kernel  estimators  of  Q(p)  and  gave  some  simulation  results  compar¬ 
ing  kernel-type  estimators  with  other  estimators.  For  arbitrarily  right-censored 
data,  Sander  (1975)  proposed  estimation  of  Q(p)  by  the  quantile  function  of  the 
product-limit  estimator,  and  she  and  Cheng  (1981)  derived  some  asymptotic 
properties  of  that  estimator.  Also,  CsQrgff  (1983)  presented  strong  approximation 
results  for  that  estimator. 

Recently,  Padgett  (1985)  studied  a  smoothed  nonparametric  estimator  of 
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Q(p)  from  arbitrarily  right-censored  data  based  on  the  kernel  method.  It  was 
shown  that  his  estimator,  mentioned  briefly  by  Parzen  (1979,  p.  119),  was 
strongly  consistent,  and  a  small  Monte  Carlo  study  was  performed  to  compare 
the  estimator  with  the  product-limit  estimator.  In  addition,  a  simple  approxima¬ 
tion  to  this  kernel  estimator  was  shown  to  be  almost  surely  asymptotically 
equivalent  to  it. 

The  purpose  of  this  paper  is  to  further  study  the  asymptotic  properties 
of  the  estimators  proposed  by  Padgett  (1985).  In  particular,  the  asymptotic 
normality  will  be  proven,  and  the  asymptotic  equivalence  in  mean  square  of  the 
estimator  and  its  approximation  will  be  shown  under  general  conditions  on  the 
kernel  function,  bandwidth  sequence,  lifetime  distribution,  and  censoring 
mechanism. 

2.  Notation  and  Preliminaries 

Let  X°,X°,...,X°  denote  the  true  survival  times  of  n  items  or 
1  t  n 

individuals  which  are  censored  on  the  right  by  a  sequence  .U^, . . . ,Un> 

which  in  general  may  be  either  constants  or  random  variables.  It  is  assumed 

that  the  X°* s  are  nonnegative  independent  identically  distributed  random 

variables  with  common  unknown  distribution  function  F  and  unknown  quantile 

o 

function 

Q°(p)  =  “  inf{t:  FQ(t)  2p>,  OSpH. 

The  observed  right-censored  data  are  denoted  by  the  pairs  (Xj,,A^),  i«l,...,n, 

!1  if  X°  S  U± 

0  if  X°  >  U±  . 


where 


For  the  asymptotic  results  of  this  paper,  the  random  right-censorship  model 


will  be  assumed,  that  is,  U^,...,Uq  constitute  a  random  sample  from  a  distri¬ 
bution  H  (usually  unknown)  and  are  independent  of  X^,...,X°.  The  distribution 
function  of  each  X^,  i“l,...,n,  is  then  F  *  1  -  (l-Fo)(l-H). 

A  popular  estimator  of  the  survival  function  SQ(t)  ■  1  -  FQ(t)  based  on 
CXi.Ai>.  i«l,...,n,  is  the  product-limit  estimator,  proposed  by  Kaplan  and 
Meier  (1958)  as  the  "nonparametric  maximum  likelihood  estimator."  Let 
(Z^A’),  i=l,...,n,  denote  the  ordered  X^'s  along  with  their  corresponding 
A^'s.  The  product-limit  estimator  of  SQ(t),  shown  to  be  "self-consistent" 
by  Efron  (1967) ,  is  defined  by 


Pn(t) 


1, 

k-1  .  A’ 

n-i+1' 

0, 


0  <  t  S  Z  t 


\.x  K  t  -  k=2,...,n 


t  >  Z  . 
n 


Denote  the  product-limit  estimator  of  F  (t)  by  F  (t)  =  1  -  P  (t) ,  and  let 

on  n 

A 

Sj  denote  the  jump  of  P^  at  Z^ ,  that  is 


| 

11  -  W’ 

3  -  1 

\  *  A 

8j " 

1 

i 

W 

j  =  2 , . . . ,n-l 

1 

P„«_)  ■ 

3  -  ti. 

^  n  n 

Note  that 

V 

0  if  and  only  if  A.J 

-  0,  3  <  n. 

censored  observation.  Also,  denote 

si  E  Vzi+i 

3-1  J 

Sq  =0,  Zq  =  0,  and  Zn+^  =  Zq  +  e,  for  some  positive  constant  e. 

It  is  natural  to  estimate  by  the  product-limit  quantile  function 

Q  (p)  s  (°  *  inf{t:  F  (t)  k  p).  Sander  (1975)  and  Cheng  (1981)  obtained 
n  p  n 


asymptotic  normality  results  for  Padgett  (1985)  smoothed  Qq  by  the 

kernel  method  to  obtain  the  estimator 

Q^p)  -  h"1  /J  Qn(t)K((t-p)/hn)dt 


-1  11  ^1 

h  '  I  Z.  /  1  K((t-p)/h  )dt, 

“  1-1  1  Si-1  n 


(2.1) 


where  K  is  an  appropriate  kernel  function  and  {hQ}  is  a  bandwidth  sequence. 
Also,  a  simpler  kernel-type  estimator  which  is  an  approximation  to  (2.1)  was 
defined  by 


Q*(P)  *  h"1  Z  Z  s  K((S.  -  p)/h  ). 
n  n  .  ^  il  l  n 


(2.2) 


Note  that  only  the  uncensored  observations  actually  appear  in  the  sums  of  (2.1) 
and  (2.2). 

In  the  next  section,  the  asymptotic  normality  of  Qq(p)  and  Qn(p)  will 
be  obtained.  The  following  general  conditions  on  the  kernel  function,  the 
bandwidth  sequence,  and  the  lifetime  and  censoring  distributions  will  be  assumed: 

(h.l)  h^  0  as  n  -*■  «; 

(K.l)  K(x)  is  a  bounded  probability  density  function  which  has  finite 
support,  i.e.  K(x)  -  0  for  |x|  >  c  for  some  c  >  0; 

(K.2)  K  is  symmetric  about  zero; 

(K.3)  K  satisfies  a  Llpschitz  condition,  i.e.  there  exist  a  constant 
T  such  that  for  all  x,y, 

|K(x)  -  K(y) 1  s  r|x  -  y| ; 

(F.l)  F  is  continuous  with  density  function  f  ;  and 
o  o 


(F.2)  H(T_  )  <  1,  where  T_  -  sup{t:  F^(t)  <  1} . 
Fo  Fo  ° 
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It  should  be  noted  that  these  conditions  are  not  prohibitive  and  (F.l)  and 
(F.2)  are  similar  to  conditions  required  by  Cheng  (1981).  Also,  (F.2)  insures 
that  observations  will  be  available  from  the  entire  support  of  F^,  a  common 
condition  in  random  right-censorship  models. 

3.  The  Main  Results 

In  this  section,  the  main  results  are  summarized  in  Theorems  3.1  and  3.2. 

The  proofs  will  be  presented  in  the  next  section.  Theorem  3.1  gives  conditions 

for  the  asymptotic  normality  of  Qn(p).  The  asymptotic  uniform  mean-squared 

* 

equivalence  of  and  will  be  shown  in  Theorem  3.2. 

Theorem  3.1.  Assume,  in  addition  to  conditions  (h.l),  (K.l)  -  (K.3),  (F.l), 

and  (F.2),  that  the  derivative  f'  is  continuous  at  E°  and  f  (E°)  >  0. 

o  pop 

Suppose  {h^}  is  such  that  n*^hQ  0  as  n  -*■  “.  Then  for  0  <  p  <  T, 
where  T  <  1,  as  n  -*■  ®,  */n[Qn(p)  -  Q°(p)]  -*•  Z  in  distribution,  where  Z 
is  a  normally  distributed  random  variable  with  mean  zero  and  variance 

2  2  C  -2  dFn<u> 

°Z  «  (1  -  p)  /0P  [1  -  F(u)]  -=2-— 

0  f2  o 

o  p 

with  1  -  F(u)  ■  [1  -  F  (u)][l  -  H(u)]  and  F  (u)  ■  P(X.  £  u,  A.  ■  1) ,  the 

o  oil" 

subdistribution  function  of  the  uncensored  observations. 

Note  that  an  example  of  a  bandwidth  sequence  that  satisfies  the  conditions 

of  Theorem  3.1  is  h  ■  cn  ®  with  6  >  1/A. 

n 

* 

The  next  theorem  gives  some  conditions  for  which  Qq  and  are 

asymptotically  uniformly  equivalent  in  mean  square. 
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Theorem  3.2.  Suppose  Fq  and  H  are  continuous  and  (h.l),  (K.l),  and  (K.3) 

hold.  Assume  E(X2<1)  <  00  for  some  q  >  1,  where  *  mintX^,^}.  Let  n 

be  such  that  [1-F  (n) ) [l-H(n) 1  >  0  and  let  T*  -  F  (u) .  Then  for  all 
o  o 

T  e  [0,T*>,  lim  E[  sup  lQ*(p)  -  Q  (p)|2]  ■  0,  provided  n1//2h*  «°  as 

n-*»  0<psT  ^  n 

n 

4.  Proofs  of  Theorems 

The  following  two  lemmas  will  be  needed  in  the  proof  of  Theorem  3.1. 

In  this  section,  (K(s,t):  0  <.  s  £  1,  t  2:  0}  will  denote  the  generalized 
Kiefer  process  as  stated  by  Csorgo*  (1983,  Ch.  8). 

Lemma  1.  For  0  <  p  <  T,  where  T  <  1,  and  6  <  min{T  -  p,p},  as 
n  -*•  “ 

sup  |n  ^2[K(p+h,n)  -  K(p,n)]|  -*•  0  in  probability. 

|  h  1  <6 

The  proof  of  Lemma  1  follows  the  same  argument  as  the  proof  of  Theorem  8.2.1 
of  Csorgo  (1983) . 

Lemma  2.  Suppose  the  derivative  f'  is  continuous  at  £°  and  f  (£°)  >  0. 
-  o  p  o  p 

Under  assumptions  (h.l),  (K.l),  (K.2),  (F.l)  and  (F.2),  for  0  <  p  <  1, 

| /q  UQ(t)  -  qnCp)  l*1”1  K(~£)dt|  -*■  0  in  probability 

n 

1  /2  A  - 

*  n  t)  -  Q  (t)]  denotes  the  product-limit 

quantile  process. 


as  n  -+ 


where 


v° 


Proof .  For  any  given  6  >  0,  there  exists  N  such  that  when  n  2:  N, 


(4.1) 


I'J  IV^  -  Vp)!  K1 

n 

■  I  W»(t>  - C  K^dtl 

n 

*  sup  |q  (t)  -  q  (p) | > 
teA(S)  n  n 

where  A(6)  *  [p-6,  p+6].  By  the  conditions  on  fQ,  for  6  sufficiently 
small,  fQ(Q°(t))  >  0  for  all  t  e  A(6).  Hence,  the  right-hand-side  of  (4.1) 
is  less  than  or  equal  to 


sup  |B  (t)  -  8  (p)|»| - - - 

teA(6)  n  n  f  (Q°(t)) 


+  sup 
tcA(6) 


kn(p)[ 


1 

fo(Q°(t)) 


f0(Qu(p)) 


-]| 


where  ?n(t)  -  f0(Q°(t) )qn(t) . 


Let 


a  ■  sup 


and  b 


sup 


teA(6)  fo(Q  (t)) 


teA(6)  f  (Q°(t))  f  (Q°(p)) 
o  o 


From  Corollary  1  of  Cheng  (1981),  since  fQ  is  continuous  at  £*n(p)  -*■  Z 
in  distribution  as  n  -*■  “,  where  Z  is  a  normally  distributed  random  variable 

r° 

2  2  ’n  — 2  * 

with  mean  zero  and  variance  a  -  (1-p)  Iqp  [l-F(u)]  dFQ(u), 

with  l-F(u)  -  [1-F  (u) ] [l-H(u) ]  and  F*(u)  -  P(X.  S  u,  A  -  1) .  Therefore, 

o  oil 


sup  |a  (p)[ - ~ 


tcA(6) 


f0(Q°(t))  f0(Q°(P)) 


~ — ]|  *  b|a  (p)| 


(4.2) 


and  for  given  e  >  0 
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For  small  enough  6,  p  +  6<T<l  and  p  -  6  2  0,  so  that  by  Corollary  8.3.3 
of  Csorgo  (1983)  as  n  -»■  “ 

sup  |d  (t)  -  n  K(t,n)|  -»■  0  in  probability 

t£A(5)  n 

and 

/  2 

sup  |d  (p)  -  n  K(p,n)J  ■+■  0  in  probability. 

teA(6)  n 


By  Lemma  8.2.1  of  Csorgo  (1983)  (or  Berkes  and  Philipp,  1977)  with 

-1/2 

0  *  tQ  <  tj^  *  n,  letting  B(s)  5  n  K(s,n),  0  £  8  £  1, 

sup  J B(p)  -  B(t) |  =  sup  |B(p+h)  -  B(p)|, 
teA(6)  |h| <6 

which  by  Lemma  1  converges  to  zero  in  probability  for  sufficiently  small  6. 

Therefore,  (4.4)  converges  to  zero  in  probability  as  n  -*•  «*>. 

Finally,  since  b  depends  on  6,  letting  b  become  arbitrarily  small 

gives  from  (4.2)  and  (4.4)  that 
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sup  |£  (p)[ - - - - - ])  ->-0  in  probability. 

teA(6)  n  f  (Q°(t))  f  (Q°(p)) 
o  o 

Thus,  the  result  follows.  /// 

Proof  of  Theorem  3.1.  Analogous  to  the  beginning  of  the  proof  of 
Theorem  1  of  Yang  (1985) ,  write 

^  IQn(p)  -  Q°(p)3  =  fo [qn(t)  -  q^pm"1  K(1?)dt 

n 

+  n1/2[/jQ°(t)h^  K(^)dt  -  Q° (p)  1  +  qn(p),  (4.5) 

n 

1  /O  A  Q 

where  q^(t)  =  n  [Qn(t)  ~  ^  (t)]  as  in  Lemma  2. 

From  Lemma  2,  the  first  term  on  the  right  hand  side  of  (4.5)  is  o^(l), 
which  means  that  it  converges  to  zero  in  probability  as  n  -*•  °®.  Similar  to 
Yang's  (1985) equation  (10), 

n1/2[/Jq°(t)h“1  K(-^)dt  -  Q°(p)  ] 

n 

=  o(n1/2h2)  +  n1/2h2  Q°"(p)  r  -~K(t)dt.  (4.6) 

n  n  £ 

With  the  assumption  that  n^^h  -*■  0  as  n  (4.6)  is  also  °  (1) . 

n  P 

Therefore,  by  Corollary  1  of  Cheng  (1981),  the  conclusion  of  the  theorem 
follows.  Ill 


Proof  of  Theorem  3.2. 
Q*(p)  -  Qn<P)  ■ 


For 

n 

I 

i*=l 


0  <  p  £  T,  write 

S  — p  s 

V8i  K<-h>  -  's< ,  K(if)dc1' 

n  i-1  n 


When  s^  >  0,  that  is,  is  uncensored,  let  be  an  interior  point  of 

the  interval  (S^  with  probability  one  so  that 


s . K((S.-p)/h  )  ■  /_  K((t-p)/h  )dt  almost  surely. 

11  n  1  n 

Then  by  condition  (K.3),  letting  1^  denote  the  indicator  function  of  the 


set  A, 


|Q>>  -  V)lI[0,T]('’>  s  C  ^  -AlMWi-PJ/b,,) 


-  K«VP>/hn)|  VT)(p)  IIsSch 


i  n’ 


-  r  hn  ZisilSi”SilI[0,T](p)  I[£l*1-chn,l](p) 

S  r  C  ^  Vi  II0.T)<l>)  I(Sj-chn.l](''>  al"°St  SUrely- 


lO>  -  V>>l2  iI0.T]<rt  £  r2C«  Vi  Vf-ch  ,il(p))2-  <4-7) 

1  n 


a  Zl8l  I[0iT](p)  I[S*-ch  ,l](p)) 

l  n 

<  z  z±s±  I[0>T](p)  I[sJ-chn,l](p) 


*  Tz^dw  -  f^vI+IV^-V2?!)3  l[sl.2-chn,i]<',)-  (4-8> 


where  g(x  )  denotes  the  limit  from  the  left  at  x  of  the  function  g. 

* 

There  is  an  N  such  that  for  all  n  2  N,  T  +  ch  <  T  and 

n 

T  <  F  (n-e)  where  T„  =  T  +  ch^.  For  all  such  n’s 


1  z;.*ivv  -  vvi  +  ivv  -  vvi>3  i[o,ii<">iB1.2-chii>i)<'» 


(4.9) 


S  8  E  zj  ,  sup.  |F  (x)  -  F  (x)|3, 

1-1  OixSQ  (T„) 
n  n 


which  is  independent  of  p.  Notice  also  that 

sup  |F  (x)-F  (x)|3  £  sup  (F  (x)  -  F  (x) | 3  +  Ifft  .  .  ,• 

0 (T  )+e  n  °  O^xSn  n  °  [VV>n'el 

n  N 

From  the  exponential  bound  in  Theorem  2  of  Fttldes  and  RejtO  (1981),  for  any 

1/2  A  y» 

r  2  1,  {n  sup  |F  (x)  -  F  (x) | )  ,  n  £  1}  is  uniformly  integrable.  Also 
O^x^n 


E(n3/2  I..  ,  .  ,)r 

fQn(V>n_eJ 


n3r/2P[Qr(TN)  >  n-e] 


n3r/2pivv  -  f;1<tn>  > "  -  ^v-1' 


Letting  y  ■  n  -  F_X(T  )  -  e,  where  e  is  chosen  so  that  y  >  0, 
o  N 


?tW  -  Fo  <V  >  Y] 


s  P[TN  >  Fn(n-e)I 


P[T„  -  F  (n-e)  >  F  (n-e)  -  F  (n-e);) 
no  n  0 


S  P[|f  (n-e)  -  F  (n-e) |  >  F  (n-e)  -  T  ], 

n  q  on 
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By  the  same  exponential  bound  as  in  Theorem  2  of  Foldes  and  Rejto  (1981), 
as  n  -*■  • 


e<”3/2  It^cTN»n-e]>r  -  °- 


Therefore,  {(n1^2  sup  f F  (x)-F  (x) | )r,  nil)  Is  uniformly  integrable. 

0£x^(TN)+e  n  ° 

By  hypothesis,  E(xJq)  <  »  for  some  q  >  1,  so  { (n-1  E  zj)q,  nil}  is 

1  1 

uniformly  integrable.  Thus, 


E(n3/2  sup  |F  (x)-F  (x)|3  n"1  E  Z2) 

O^xSQ  (T  )+e  n  °  1 

n  n 


(E(n3^2  sup 


|F  (x)-F  (x) |  )  ) 
O^x^d^e  n  ° 


S'1/S  E((n-1 


E'  Z2)q)^^ q 
1  1 


0(1). 


(4.10) 


Therefore,  from  (4.7)  -  (4.10),  by  the  hypothesis  n3^2h^  •*  ®  , 


E[  sup  |Q  (p)  -  Q_(p)r]  -  o(l) , 
OiSpST  n  n 


completing  the  proof.  /// 
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